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Abstract — It was shown recently that CSMA (Carrier Sense 
Multiple Access)-like distributed algorithms can achieve the 
maximal throughput in wireless networks (and task processing 
networks) under certain assumptions. One important, but ideal- 
ized assumption is that the sensing time is negligible, so that there 
is no collision. In this paper, we study more practical CSMA- 
based scheduling algorithms with collisions. First, we provide a 
Markov chain model and give an explicit throughput formula 
which takes into account the cost of collisions and overhead. 
The formula has a simple form since the Markov chain is 
"almost" time-reversible. Second, we propose transmission-length 
control algorithms to approach throughput optimality in this 
case. Sufficient conditions are given to ensure the convergence 
and stability of the proposed algorithms. Finally, we characterize 
the relationship between the CSMA parameters (such as the 
maximum packet lengths) and the achievable capacity region. 

Index Terms — Distributed scheduling, CSMA, Markov chain, 
convex optimization 



I. Introduction 

Efficient resource allocation is essential to achieve high 
utilization of a class of networks with resource-sharing con- 
straints, such as wireless networks and stochastic processing 
networks (SPN 0). In wireless networks, certain links cannot 
transmit at the same time due to the interference constraints 
among them. In a SPN, two tasks cannot be processed simulta- 
neously if they both require monopolizing a common resource. 
A scheduling algorithm determines which link to activate (or 
which task to process) at a given time without violating these 
constraints. Designing efficient distributed scheduling algo- 
rithms to achieve high throughput is especially a challenging 
task 0. 

Maximal-weight scheduling (MWS) [3] is a classical 
throughput-optimal algorithm. That is, MWS can stabilize all 
queues in the network as long as the arrival rates are within the 
capacity region. MWS operates in slotted time. In each slot, 
a set of non-conflicting links (called an "independent set", 
or "IS") that have the maximal weight (i.e., summation of 
queue lengths) are scheduled. However, implementing MWS 
in general networks is quite difficult for two reasons, (i) MWS 
is inherently a centralized algorithm and is not amenable to 
distributed implementation; (ii) finding a maximal-weighted IS 
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(in each slot) is NP-complete in general and is hard even for 
centralized algorithms. 

On the other hand, there has been active research on 
low-complexity but suboptimal scheduling algorithms. For 
example, reference JSJ shows that Maximal Scheduling can 
only guarantee a fraction of the network capacity. A related 
algorithm has been studied in [7| in the context of IEEE 
802.11 networks. Longest-Queue-First (LQF) algorithm (see, 
for example, J9), QUI, flj], CD), which greedily schedules 
queues in the descending order of the queue lengths, tends to 
achieve higher throughput than Maximal Scheduling, although 
it is not throughput-optimal in general iflOl . Reference lfT3l 
proposed random-access-based algorithms that can achieve 
performance comparable to that of maximal-size scheduling. 

Recently, we proposed a distributed adaptive CSMA (Car- 
rier Sense Multiple Access) algorithm (16| that is throughput- 
optimal for a general interference model, under certain as- 
sumptions (further explained later). The algorithm has a few 
desirable features. It is distributed (i.e., each node only uses 
its own backlog information), asynchronous (i.e., nodes do 
not need to synchronize their transmission) and requires no 
control message. (In [17], Rajagopalan and Shah indepen- 
dently proposed a similar randomized algorithm in the context 
of optical networks. Reference lfT8l showed that under a 
"node-exclusive" interference model, CSMA can be made 
throughput-optimal in an asymptotic regime.) We have also 
developed a joint algorithm in |[T6l that combined the adaptive 
CSMA scheduling with congestion control to approach the 
maximal total utility of competing data flows. 

However, the algorithms in |[T6l assume an idealized CSMA 
protocol (03), 1251 . 1261 . Il27l ). meaning that the sensing is 
instantaneous, so that conflicting links do not transmit at the 
same time (i.e., collisions do not occur). In many situations, 
however, this is an unnatural assumption. For example, in 
CSMA/CA wireless networks, due to the propagation delay 
and processing time, sensing is not instantaneous. Instead, time 
can be viewed as divided into discrete minislots, and collisions 
happen if multiple conflicting links try to transmit in the same 
minislot. When a collision occurs, all links that are involved 
lose their packets, and will try again later. 

In this paper, we study this important practical issue when 
designing CSMA-based scheduling algorithms. We follow four 
main steps: (1) we first present a Markov chain model for a 
simple CSMA protocol with collisions, and give an explicit 
throughput formula (in section (TTJ that has a simple form 
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since the Markov chain is "almost" time-reversible; (2) we 
show that the algorithms in [[161 can be extended to approach 
throughput optimality even with collisions ( section [TTTb; (3) we 
give sufficient conditions to ensure the convergence/stability of 
the proposed algorithms (section UTTt: (4) finally, we discuss 
the tradeoff between the achievable capacity region and short- 
term fairness, and we characterize the relationship between the 
CSMA parameters (such as the maximum packet lengths) and 
the achievable capacity region. 

Although step (2) can be viewed as a generalization of 
|[T6l . we believe that this generalization is important and 
non-trivial. The importance, as mentioned above, is because 
collisions are unavoidable in CSMA/CA wireless networks, 
and the collision-free model used in |16| does not provide 
enough accuracy. The generalization is non-trivial for the 
following reasons. Step (2) requires the result of step (1). In 
step (1), the Markov chain used to model the CSMA protocol 
is no longer time-reversible as in |[T6l . We need to re-define 
the state space in order to compute the service rates it can 
provide. Interestingly, as a result of our design the chain is 
"almost" time-reversible which can be exploited. In step (2), 
in view of the expression of service rates derived in step 
(1), it is important to realize that adjusting the backoff times 
as in lfl6l does not lead to the desirable throughput-optimal 
property. Instead, one should adjust the transmission lengths. 
Different from flffl . we further show that the "optimal" CSMA 
parameters (in this case the mean transmission lengths) are 
unique. This fact is needed to establish the convergence of 
our algorithm in step (3). 

We note that in a recent work (8), Ni and Srikant also 
proposed a CSMA-like algorithm to achieve near-optimal 
throughput with collisions taken into account. The algorithm 
in [H uses synchronized and alternate control phase and data 
phase. It is designed to realize a discrete-time CSMA (in 
the data phase) which has the same stationary distribution 
as its continuous counterpart in fl6l . The control phase does 
not contribute to the throughput and can be viewed as the 
protocol overhead. Different from (8), our algorithm here is 
asynchronous, and has more resemblance to the RTS/CTS 
mode in IEEE 802.11. Although the algorithm in QD is 
quite elegant and could potentially be applied to other time- 
slotted systems, we believe that it is an interesting problem 
to understand how to use the asynchronous algorithm to 
achieve throughput-optimality. Also, due to its similarity to 
the RTS/CTS mode of IEEE 802.11, the throughput analysis 
in this paper could also deepen the understanding of 802.11 
in general topologies. 



II. CSMA/CA-BASED SCHEDULING WITH COLLISIONS 
A. Model 

In this section we present a model for CSMA/CA-based 
scheduling with collisions. Note that the goal of the paper is 
not to propose a comprehensive model that captures all details 
for IEEE 802. 1 1 networks and predict the performance of such 
networks (The literature in that area has been very rich. See, 
for example, l24l . 11251 and the references therein.) Instead, at a 
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Fig. 1: Infrastructure network 
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Fig. 2: Ad-hoc network 



more abstract level, we are interested in a distributed schedul- 
ing algorithm that is inspired by CSMA/CA, and designing 
adaptive algorithms to approach throughput-optimality. 

Consider a (single-channel) wireless network. Define a 
"link" as an (ordered) transmitter-receiver pair. Assume that 
there are K links, and denote the set of links by M (then, 
K = \Af\). Without loss of generality, assume that each 
link has a capacity of 1. We say that two links conflict if 
they cannot transmit (or, "be active") at the same time due 
to interference. (The conflict relationship is assumed to be 
symmetric.) Accordingly, define G as the conflict graph. Each 
vertex in G represents a link, and there is an edge between 
two vertexes if the corresponding links conflict. Note that this 
simple conflict model may not reflect all possible interference 
structures that could occur in wireless networks. However, it 
does provide a useful abstraction and has been used widely in 
literature (see, for example, 11251 and flU). 

Fig. [11(a) shows a wireless LAN with 6 links. The network's 
conflict graph is a full graph (Fig. Q] (b)). (Circles represent 
nodes and rectangles represent links.) Fig. [2(a) shows an ad- 
hoc network with 3 links. Assume that link 1, 2 conflict, and 
link 2, 3 conflict. Then the network's conflict graph is Fig. [2] 
(b). 

Basic Protocol 

We now describe the basic CSMA/CA protocol with fixed 
transmission probabilities (which suffices for our later devel- 
opment.) Let a be the duration of each idle slot (or "minislot"). 
(<r should be at least the time needed by any wireless station 
to detect the transmission of any other station. Specifically, 
it accounts for the propagation delay, the time needed to 
switch from the receiving to the transmitting state, and the 
time to signal to the MAC layer the state of the channel lfl5l . 
The value of a varies for different physical layers. In IEEE 
802.11a, for example, a = 9fis.) In the following we will 
simply use "slot" to refer to the minislot. 

Assume that all links are saturated (i.e., always have packets 
to transmit). In each slot, if (the transmitter of) link i is not 
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already transmitting and if the medium is idle, the transmitter 
of link i starts transmission with probability pi. If at a certain 
slot, link i did not choose to transmit but a conflicting 
link starts transmitting, then link i keeps silent until that 
transmission ends. If conflicting links start transmitting at 
the same slot, then a collision happens. [We assume that the 
network has no hidden node (HN). The case with HNs will 
be discussed in Section [V-C I For possible ways to address the 
HN problem, please refer to |fl9ll and its references.] 

Each link transmits a short probe packet with length 7 
(similar to the RTS packet in 802.11) before the data is 
transmitted. (All "lengths" here are measured in number of 
slots and are assumed to be integers.) This increases the 
overhead of successful transmissions, but can avoid collisions 
of long data packets. When a collision happens, only the probe 
packets collide, so each collision lasts a length of 7. Assume 
that a successful transmission of link i lasts Tj, which includes 
a constant overhead r' (composed of RTS, CTS, ACK, etc) 
and the data payload rf which is a random variable. Clearly 
n > t'. Let the p.m.f. (probability mass function) of r, be 

Pr{n = b i } = Pi{bi),VbieZ+ + (1) 

and assume that the p.m.f. has a finite support, i.e., Pj(&i) = 
0, Vbi > b max > O0Then the mean of r, is 

T i :=E(r i )= £ b-P t {b). (2) 

bez ++ 

Fig. [3] illustrates the timeline of the 3 -link network in Fig. 
|2] (b), where link 1 and 2 conflict, and link 2 and 3 conflict. 

We note a subtle point in our modeling. In IEEE 802.11, 
a link can attempt to start a transmission only after it has 
sensed the medium as idle for a constant time (which is 
called DIFS, or "DCF Inter Frame Space"). To take this into 
account, DIFS is included in the packet transmission length 
Ti and the collision length 7. In particular, for a successful 
transmission of link i, DIFS is included in the constant 
overhead r'. Although DIFS, as part of t', is actually after 
the payload, in Fig. [3] we plot r' before the payload. This is 
for convenience and does not affect our results. So, under this 
model, a link can attempt to start a transmission immediately 
after the transmissions of its conflicting links end. 

The above model is almost time-reversible such that a 
simple throughput formula can be derived. A process is 
"time-reversible" if the process and its time-reversed process 
are statistically indistinguishable JTJ . Our model, in Fig. [3] 
reversed in time, follows the same protocol as described above, 
except for the order of the overhead and the payload, which are 
reversed. A key reason for this property is that the collisions 
start and finish at the same time. (This point will be made 
more precise in Appendix A.) 

B. Notation 

Let the "on-off state" be a; £ {0,1} X where x k , the k- 
th element of x, is such that x k = 1 if link k is active 

'The finite support assumption is not a restrictive one, since in practical 
wireless networks there is usually an upper bound on the packet size. However, 
the CSMA/CA Markov chain defined later is still ergodic even without the 
assumption. 
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Fig. 3: Timeline in the basic model (In this figure, Ti = Ti,i 
1,2,3 are constants.) 
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Fig. 4: An example conflict graph (each square represents a 
link). In this on-off state x, links 1, 2, 5 are active. So S(x) = 
{5}, Z(x) = {1,2}, h(x) = 1. 



(transmitting) in state x, and Xk — otherwise. Thus, x is 
a vector indicating which links are active in a given slot. Let 
G(x) be the subgraph of G after removing all vertices (each 
representing a link) with state (i.e., any link j with Xj = 0) 
and their associated edges. In general, G(x) is composed of 
a number of connected components (simply called "compo- 
nents") C m (x),m = 1,2,..., M(x) (where each component 
is a set of links, and M(x) is the total number of components 
in G(x)). If a component C m (x) has only one active link (i.e., 
\C m (x)\ = 1), then this link is having a successful transmis- 
sion; if |C m (x)| > 1, then all the links in the component 
are experiencing a collision. Let the set of "successful" links 
in state x be S(x) := {k\k 6 C m (x) with |C m (a;)| = 1}, 
and the set of links that are experiencing collisions be Z(x). 
Also, define the "collision number" h(x) as the number of 
components in G{x) with size larger than 1. Fig. @] shows an 
example. Note that the transmissions in a collision component 
C m {x) are "synchronized", i.e., the links in C m (x) must have 
started transmitting in the same slot, and will end transmitting 
in the same slot after 7 slots (the length of the probe packets). 

C. Computation of the service rates 

In order to compute the service rates of all the links under 
the above CSMA protocol when all the links are saturated, we 
first define the underlying discrete-time Markov chain which 
we call the CSMA/CA Markov chain. 

The Markov chain evolves slot by slot0 The state of the 
Markov chain in a slot is 



w := {x, ((6 fc ,a fe ),Vfc : x k = 1)} 



(3) 



where b k is the total length of the current packet link k is 
transmitting, a k is the remaining time (including the current 

2 For the ease of analysis, we make the modeling assumption that the links 
are synchronized at the slot level. 
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Fig. 5: Example of the CSMA/CA Markov chain 



slot) before the transmission of link k ends. 
For example, in Fig. [5] the state w and w' are 

w = {x = (1, 0, 1) T , (6i = 11, ax = 1), (63 - 10, a 3 = 4)} 

(4) 

and 



!/ = {s=(0,0,lr,(&3 = 10,^ = 3)} 



Note that in any state w as defined in (01, we have 

(I) 1 < a k < b kl Vk : x k = 1. 

(II) P k (b k )>Q,VkeS{x). 

(III) If k e Z(x), then b k = 7 and a k G {1,2,..., 7}. 
An important observation here is that the transmissions in a 
collision component C m {x) are "synchronized", i.e., the links 
in C m (x) must have started transmitting at the same time, 
and will end transmitting at the same time, so all links in the 
component C m (x) have the same remaining time. (To see this, 
first note that in the case of a collision only the probe packets 
get transmitted, and their transmission times 7 are identical 
for all links. Second, any two links i and j in this component 
with an edge between them must have started transmitting 
at the same time. Otherwise, if i starts earlier, j would not 
transmit since it already hears i's transmission; and vice versa. 
By induction, all links in the component must have started 
transmitting at the same time.) So, we can write a k = aS m ' 
for any k G C m (x) where |C m (a;)| > 1, and denotes the 
remaining time of the component C m (x). 

We say that a state w is valid iff it satisfies (I)~(III) above. 

Since the transmission lengths are always bounded by b max 
by assumption, we have b k < b max , and therefore the Markov 
chain has a finite number of states, and is ergodic. As detailed 
in AppendixlAl a nice property of this Markov chain is that it is 
"almost" time-reversible. As a result, its stationary distribution 
has a simple product-form (Appendix [A), from which the 
probability of any on-off state x can be computed: 

Theorem 1: Under the stationary distribution, the probabil- 
ity of x G {0, 1}^ in a given slot is 



p(x) 



n n) n (I-?.) n % 

k£S(x) i-.Xi=0 j-Xj=l 

= ^h h{x) n r*)n*T9*" x< (6) 

kes(x) i=i 
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where := 1 — pi, Tj is the mean transmission length of link 
i (as defined in (0)), and E is a normalizing term such that 



The proof is given in Appendix AQ 

Remark: Note that in x, some links can be in a collision 
state, just as in IEEE 802.11. This is reflected in the 
term in @. Expression (O differs from the idealized-CSMA 
case in |[T6l and the stationary distribution in the data phase 
in the protocol proposed in |8|], due to the difference of the 
three protocols. 

Now we re-parametrize T k by a variable r k . Let T k := t' + 
To • exp(rfc), where r', as we defined, is the overhead of a 
successful transmission (including RTS, CTS, ACK packets, 
DIFS, etc.), and Tf := Tq ■ exp(r k ) is the mean length of 
the payload. Tq > is a constant "reference payload length". 
Let r be the vector of r k 's. By Theorem the stationary 
probability of a; in a slot (with a given r) is 

Pi^) = 1^9( x )- II (r' + T -exp(r k )) (7) 

kes(x) 



E(t)- 



(5) where g(x) = 7 



h(x 



nlxPi 



l) Xi is not related to r, and the 



normalizing term is 

E(t)= Yl W)- II (r' + T -exp(r fc ))]. (8) 

x'e{o,i} K kes(x') 

Then, the stationary probability that link k is transmitting a 
payload in a given slot is 

T ■ exp(r fe ) 



Sfe(r) 



r' + T ■ exp(r fc ) 



(9) 



x-.kes(x) 



Recall that the capacity of each link is 1. Also, it's easy 
to show that the CSMA/CA Markov chain is ergodic. As a 
result, if r is fixed, the long-term average throughput of link 
k converges to the stationary probability s k (r). So we say that 
Sfc(r) G [0, 1] is the service rate of link k. 

III. A Distributed algorithm to approach 

THROUGHPUT-OPTIMALITY 
A. The scheduling problem 

Assume that the conflict graph G has different indepen- 
dent sets ("IS", not confined to "maximal independent sets"), 
where each IS is a set of links that can transmit simultaneously 
without conflict. Denote an IS by a G {0, 1}^, a 0-1 vector 
that indicates which links are transmitting in this IS. The fc'th 
element of a, a k = 1 if link k is transmitting in this IS, and 
a k = otherwise. Let X be the set of ISs. 

We now describe the scheduling problem which is the 
focus of the paper. Traffic arrives at link k with an arrival 
rate Aft G (0, 1). For simplicity, assume the following i.i.d. 
Bernoulli arrivals (although this can be easily generalized): at 
the beginning of slot M ■ i, (i = 0, 1, 2, . . . ), a packet with a 

3 In this paper, several kinds of "states" are defined. With a little abuse of 
notation, we always use p(-) to denote the probability of the "state" under the 
stationary distribution of the CSMA/CA Markov chain. This does not cause 
confusion since the meaning of p( ) is clear from its argument. 

4 In 1201 , a similar model for CSMA/CA network is formulated with analogy 
to a loss network [211 . However, since 1201 studied the case when the links are 
unsaturated, the explicit expression of the stationary distribution was difficult 
to obtain. 
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length of M slots arrives at link fc with probability A^. (That 
is, the packet would take M slots to transmit.) Clearly, link fc 
needs to be active with a probability at least A^ to serve the 
arrivals. Denote the vector of arrival rates by A e 72-+ . 

Since we focus on the scheduling problem, all the packets 
traverse only one link (i.e., single-hop) before they leave the 
network. However, the results here can be extended to multi- 
hop networks and be combined with congestion control as in 

EES). 

Definition 1: We say that A is feasible iff it can be written 

as A = Y,aex\P° ' a ] where P° - and ^*ex V° = 1 - That 
is, there is a schedule of the independent sets (including the 

non-maximal ones) that can serve the arrivals. Denote the set 

of feasible A by C. We say that A is strictly feasible iff A e C, 

where C is the interior of C. 

A scheduling algorithm is said to be "throughput-optimal" if 
it can "support" any A <E C. In this paper, this means that for 
any A £ C, the scheduling algorithm can provide to link k a 
service rate at least X k for all k. 

B. CSMA scheduling with collisions 

The following theorem states that any service rates equal to 
A 6 C can be achieved by properly choosing the mean payload 
lengths T k := T exp(r k ), Vfc. 

Theorem 2: Assume that 7, r' > 0, and transmission prob- 
abilities pk G (0,1), Vfc are fixed. Given any A e C, there 
exists a unique r* £ TZ K such that the service rate of link fc 
is equal to the arrival rate for all fc: 

Sfe (r*) = A fe ,Vfc. (10) 

Moreover, r* is the solution of the convex optimization 
problem 

maxi(r;A) (11) 

r 

where 

L(r;A)=^(A fe r fc )-log(£(r)), (12) 

fc 

with E(r) defined in (0. This is because <9L(r; X)/dr k = 
At - s fe (r), Vfc. 

The proof is in Appendix B. 
Theorem [2] motivates us to design a gradient algorithm to 
solve problem (fTTT i. However, due to the randomness of the 
system, X k and s k (r) cannot be obtained directly and need to 
be estimated. We design the following distributed algorithm, 
where each link fc dynamically adjusts its mean payload length 
Tf based on local information. 

Algorithm 1: Transmission length control algorithm 

The vector r is updated every M slots. Specifically, it is 
updated at the beginning of slot M ■ i, i = 1,2,.... Denote 
by ti the time when slot M ■ i begins. Also define to = 0. 
Let "period i" be the time between tj_i and i,;, and r(i) be 
the value of r at the end of period i, i.e., at time Initially, 
link fc sets r fc (0) G [r min ,r max } where r mm ,r max are two 

5 That is, C := {A' G C\B{\' , d) C C for some d > 0}, where B(A', d) = 
{A| 1 1 A — A' 1 1 2 < d} is a ball centered at A' with radius d. 



parameters (to be further discussed). Then at time ti, i = 
1,2,..., each link fc updates 

r fc (i) = r fc (i - 1) + a(i)[A' fc (i) - s' k (i) + h(r k (i - 1))] (13) 

where h(-) is a "penalty function" to be defined later, a(i) > 
is the step size in period i, X' k (i), s' k (i) are the empirical 
average arrival rate and service rate in period i (i.e., the actual 
amount of arrived traffic and served traffic in period i divided 
by M). 

The use of dummy bits: An important point here is that 
we let link fc add dummy bits to the payload when its queue 
has less bits than what is specified by the algorithm (e.g., in 
(fT~5T > below). If the queue is empty, then dummy packets are 
transmitted with the specified size. So, each link is saturated. 
This ensures that the CSMA/CA Markov chain has the desired 
stationary distribution in ©. The transmitted dummy bits are 
also included in the computation of s' k (i). (Although the use 
of dummy bits consumes bandwidth, it simplifies our analysis, 
and does not prevent us from achieving the primary goal, i.e., 
approaching throughput-optimality. In Section IV-BI we also 
simulate the case without dummy bits.) 

Note that \' k (i), s' k (i) are random variables which are gener- 
ally not equal to X k and s k (r(i — 1)). Assume that the maximal 
instantaneous arrival rate is A, so X' k (i) < X,\/k,i. 

Also, in ( fT3] l. the penalty function h(-) is defined as 

{Train V rt ?/ <C Tmin 
if y £ [r m in,r ma x] (14) 

^max V if 1/ Trnax • 

As shown in the Appendix, this function keeps r(i) in a 
bounded region. (This is a "softer" approach than directly 
projecting r k (i) to the set [r min ,r max ]. The purpose is only 
to simplify the proof of Theorem [3] later.) 

In period i + 1, given r(i), we need to choose t? (i), the 
payload lengths of each link fc, so that -E(r? (i)) = T^{i) = 
T exp(r k (i)). If T k (i) is an integer, then we let r£(i) = 
T k (i); otherwise, we randomize T k (i) as follows: 

*n = / [T " (l)1 with probability T " W - L T " «J 

Tfc ^ \ LZ£(i)J with probability \T%(i)] - 2*(i). 

(15) 

Here, for simplicity, we have assumed that the arrived packets 
can be fragmented and reassembled to obtain the desired 
lengths |~Tf(£)] or |_^T(*)J- However, one can avoid the frag- 
mentation by randomizing the number of transmitted packets 
(each with a length of M slots) in a similar way. When there 
are not enough bits in the queue, "dummy bits" are generated 
(as mentioned before) to satisfy E(r k (i)) = Tq exp(r k (i)) and 
make the links always saturated. 

Intuitively speaking, Algorithm 1 says that when r k e 
[Tmin: r max], if the empirical arrival rate of link fc is larger than 
the service rate, then link fc should transmit more aggressively 
by using a larger mean transmission length, and vice versa. 

Algorithm 1 is parametrized by r m i n , r max which are fixed 
during the execution of the algorithm. Note that the choice 
of r max affects the maximal possible payload length. Also, as 
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discussed below, the choices of r max and r m , n also determine 
the "capacity region" of Algorithm 1. 
We define the region of arrival rates 

C{r mini r max ) := {A G C|r*(A) G {r min ,r max ) K } (16) 

where r*(A) denotes the unique solution of max r L(r;A) 
(such that Sfc(r*) = Afc,Vfc, by Theorem |2j. Later we show 
that the algorithm can "support" any A G C(r m i n ,r max ) in 
some sense under certain conditions on the step sizes. We 
will also give a characterization of the region C(r m i n ,r max ) 
later in section HVl 

Clearly, C(r min , r max ) ->■ C as r mm -> -oo and r mox ->■ 
oo, where C is the set of all strictly feasible A (by Theo- 
rem |2). Therefore, although given (r m i n ,r max ) the region 
C(r TOin , r miM ) is smaller than C, one can choose (r 
to arbitrarily approach the maximal capacity region C. Also, 
there is a tradeoff between the capacity region and the maximal 
packet length. 

Theorem 3: Assume that the vector of arrival rates A G 
C(r min ,r max ). With Algorithm 1, 

(i) If a(i) > is non-increasing and satisfies J^i a W = 00 > 
J2i a ("i) 2 < 00 an d — 1 (f° r example, a(i) = then 
r(i) — > r* as i — > oo with probability 1, where r* satisfies 
s fc (r*) = A fc ,Vfc. 

(ii) The case with constant step size (i.e., a(i) = a, Vi): 
For any <5 > 0, there exists a small enough a > such that 
liminf j^oo J^ii s l-(*)/^] — Afc — <5, Vfc with probability 1. In 
other words, one can achieve average service rates arbitrarily 
close to the arrival rates by choosing a small enough. 
Remark: In (8) which proposed an alternative algorithm to 
deal with collisions, the authors made an idealized time- 
scale-separation assumption that the CSMA/CA Markov chain 
reaches its stationary distribution for any given CSMA param- 
eters. We believe that the results in Theorem[3]can be extended 
to their algorithm. 

The complete proof of Theorem [3] is Appendix C, but the 
result can be intuitively understood as follows. If the step size 
is small (in (i), a(i) becomes small when i is large), r k is 
"quasi-static" such that roughly, the service rate is averaged 
(over multiple periods) to s k (r), and the arrival rate is averaged 
to Afe. Thus the algorithm solves the optimization problem 
(fTTT i by a stochastic approximation |23l argument, such that 
r(i) converges to r* in part (i), and r(i) is near r* with high 
probability in part (ii). 

Corollary 1: Consider a variant of Algorithm 1 below 
where the update equation of each link k is 

rfc(i) = r k (i-l)+a{i)[\' k {i)+A-s' k (i)+h(r k (i-l))] (17) 

with a small constant A > 0. That is, the algorithm "pretends" 
to serve the arrival rate A + A • 1 which is slightly larger than 
the actual A. Assume that 

A G C (^minj ^maii ^) 

:= {A|A + A • 1 G C{r mini r max )}. 

For algorithm (fTTI i. one has the following results: 
(i) if a(i) > is non-increasing and satisfies a(i) = oo, 
^,-a(i) 2 < oo and a(l) < 1 (for example, a(i) = 1/i), then 



r(i) —> r* as i — > oo with probability 1, where r* satisfies 
s fc (r*)=A fe +A>A fc ,Vfc; 

(ii) if a(i) = a (i.e., constant step size) where a is small 
enough, then all queues are positive recurrent (and therefore 
stable). 

Algorithm dTvT > is parametrized by r m i n ,r max and A. 
Clearly, as r„ im — > — oo, r max — > oo and A — ► 0, 
C'(r m i n , r max , A) — > C, the maximal capacity region. 

The proof is similar to that of Theorem [3] and is given 
in ll30l . A sketch is as follows: Part (i) is similar to (i) in 
Theorem [3] The extra fact that s k (r* ) > X k , Vfc reduces 
the queue size compared to Algorithm 1 (since when the 
queue size is large enough, it tends to decrease). Part (ii) 
holds because if we choose S = A/2, then by Theorem [3] 
liminf j^oo J] 4 J = i sU*)/^] > A fc + A - <5 > A fc ,Vfc almost 
surely if a is small enough. Then the result follows by showing 
that the queue sizes have negative drift. 

IV. Relationship between the CSMA parameters 

AND THE CAPACITY REGION 

In the previous section, we mentioned that the region 
C(r min ,r m ax) (and C'(r rnin , r max , A)) becomes larger as 
we decrease r mm and/or increase r max . Therefore, fixing 
r mm , a larger r max leads to a larger capacity region, but 
allows for larger transmission lengths. In practice, however, 
the transmission lengths should not be too long, since longer 
transmission lengths lead to larger access delay (where the 
access delay refers to the time between the beginnings of 
two consecutive successful transmissions of a link) and larger 
variations of the delay, and consequently, poorer short-term 
fairness. It is especially the case when a link has a number 
of conflicting links which do not interfere with each other. 
Then the link has to wait for all the conflicting links to 
become inactive before attempting its transmission. This issue 
has been studied in |26||22||27| in the contexts of 1-D 
and 2-D lattice topologies, and star topologies, where it is 
shown that the short-term fairness worsens when the access 
intensities (i.e., the ratios between the average transmission 
times and mean backoff times) increas^l. Although references 
1 26 1 [22 1 1 27 1 focus on the collision-free idealized-CSMA, we 
observe the same phenomenon in the simulations of our model 
(see Appendix E for some simulation results in the 1-D and 
2-D lattice topologies). 

Therefore, there is a tradeoff between the long-term effi- 
ciency (i.e., the capacity region) and short-term fairness. To 
quantify the tradeoff we need to understand two relationships. 
The first is the relationship between the maximal required 
transmission lengths and the capacity region. And the second 
is between the maximal transmission lengths and the short- 
term fairness. 

We first discuss the second relationship. For simplicity, 
assume the arrival rate vector is A, and that Algorithm 1 
has converged to the suitable mean pay load lengths Tf := 
T exp(r£(A)),Vfc (Recall that r*(A) is the vector such that 

6 Reference 1261 also showed that when the access intensities are high, 
there exists long-term unfairness in the 2-D lattice topology under different 
boundary conditions. 
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Sfc(r*(A)) = Afc,Vfc). Assume that we fix the mean payload 
lengths at Tf's, and denote the (random) access delay of link 
k by D k . We use two quantities to measure the short-term 
fairness of link k: the mean and standard deviation of D k 
(i.e., E(D k ) and a/ var(D k )). Similar to El . one has 

rpp 

E(D k ) = 



rpp 

«k(r*(A)) A,' • 



Therefore if we can find an upper bound of Tf? = 
To exp(r£(A)) (to be further discussed in this section), then an 
upper bound of E(Dk) can be obtained. (In fact, in Algorithm 
1, the long-term average of D k is also T^/X k , since the initial 
convergence phase is not significant in the long term.) On the 
other hand, obtaining an expression of y/var{D k ) for general 
topologies is difficult and deserves future research. (In Section 
IV-AI we will present some numerical results.) Therefore, how 
to choose r max to ensure that s Jvar{D k ) is lower than some 
threshold remains an open problem. 

Next we consider the first relationship. We present several 
generic bounds to characterize how the regions C(r min ,r max ) 
and C'(r min ,r max , A) depend on r max and r min . Given a 
A G C, by the definition of C(r m i n ,r max ) in (fToT ). if one 
chooses r m i n < minfcr^(A) and r max > max^r^A), then 
A G C(r m i n , r max ), so that Algorithm 1 can be used to support 
A. (r£(A) is the k-th element of r*(A).) A similar statement 



can be made for C'(r 



7mn : ' max i 



A). 



Consider a vector A >- which is at the boundary of C 
(Le., A G C but pX £ C,Vp > 1). Clearly, for p e (0,1), 
pX G C. Denote p = 1 — e. We are interested to bound 
r*((l — e)A). For the idealized CSMA model without collisions 
used in |[T6l . an earlier bound obtained in [31] (Lemma 8-(3)) 
suggests that maxfcr£((l — e)A) < 0(l/e) (where r* there 
controls the backoff times). In this section, we show a stronger 
result that, in our model with collisions, maxfc r£((l — e)A) < 
0(log(l/e)), so that the required r max to support arrival rates 
( 1 — e) A is not more than O (log ( 1 /e) ) . (Also, one can similarly 
show that the same order 0(log(l/e)) applies to the idealized 
CSMA model as well.) 

Theorem 4: We have 

i TV' 

A T r*((l - e )A) < b ■ [log(-) + log( — ) + 2G + 1] (18) 

for some constants N', b and G (defined during the proof), if 
e<l/b. When e G (1/6,1), 



A T r*((l - e)A) < [log(JV') + 2G]/e. 



(19) 



So roughly speaking, as e — >• 0, the value of r*(A) is not 
more than 0(log(l/e)) by ( fTSI ). 
The proof is in Appendix D. 

The following is a lower bound of r*(A). 

Proposition 1: Given any A G C, we have 



r*(A)>log( 



X,, 



Therefore 



mmr fe (A) > log( — ; — ). 

k T 1 — minfeAfc 



(20) 



(21) 




Fig. 6: The conflict graph in simulations 



Proof: Suppose that r^(A) < log( 1 ^ ), then the mean 
payload length is T exp(r^(A)) < r' • Afc/(1 - X k ). Note that 
the overhead of each successful transmission is r'. So, even if 
link k is successfully transmitting all the time, its service rate 



l-A 



-J{T> + 



l-A 



:) = X k 



would be strictly less than r' 
leading to a contradiction. 
Then, we have the following result. 

Corollary 2: max fc r*((l - e)A) < 0(log(l/e)) as e -> 0. 
Proof: For convenience, denote A = (1 — e)A. Since we 
are interested in the asymptotic behavior as e — > 0, assume 
that e < 0.5. Denote X m in '■= niinj; Aft > 0. Then, A^ > 
0.5A m j n , \/k. 

By <|20K we know that r*(A) > log(^) + log( T ^) > 
log(^)+log( 1 ° 5 ^" ) := r,Vk. Then, combined with 



if e < 1/6, we have for any fc, 
rt(X) < {fe.[log(^) + 2G + l] 



< {6.[log(A) + 2 G 
o • e 



Z! ~ Xk ' 

k'^k 



rft'(A)}/A fc 



ft'^ft 



Aft' • r}/X k 



= 0(log(l/e)) 
which completes the proof. ■ 

V. Numerical results 

Consider the conflict graph in Fig. [6] Let the vector 
of arrival rates be A = p ■ A, where p G (0,1) is the 
"load", and A is a convex combination of several maximal 
IS: A = 0.2 * [1, 0, 1, 0, 1, 0, 0] + 0.2 * [0, 1, 0, 0, 1, 0, 1] + 
0.2 * [0,0,0,1,0,1,0] + 0.2 * [0,1,0,0,0,1,0] + 0.2 * 
[1, 0, 1, 0, 0, 1, 0] = [0.4, 0.4, 0.4, 0.2, 0.4, 0.6, 0.2]. Since p G 
(0, 1), A is strictly feasible. Fix the transmission probabilities 
as p k = 1/16, Vfc. The "reference payload length" To = 15. 

A. Transmission length control algorithms 

We evaluate algorithm (T% (a variant of Algorithm 1) in 
our C++ simulator. The update in ( fTTI i is performed every 
M = 500 slots. Let the step size a(i) = 0.23/(2 + a'/ 100), 
the upper bound r max = 3.5, the lower bound r m - m = 0, and 
the "gap" A = 0.005. The initial value of each r k is 0. 

Let the "load" of arrival rates be p = 0.8 (i.e., A = 0.8 • A). 
The collision length (e.g., RTS length) is 7 = 5, and the 
overhead of successful transmission is r' = 10. To show 
the negative drift of the queue lengths, assume that initially 
all queue lengths are 300 data units (where each data unit 
takes 100 slots to transmit). As expected, Fig. [7] (a) shows 
the convergence of the mean payload lengths, and Fig. [7] (b) 
shows that all queues are stable. 
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Fig. 7: Simulation of Algorithm ( fTTI i (with the conflict graph 
in Fig. [6) 



p 


0.65 


0.7 


0.75 


0.8 


0.85 


Mean of D3 (in slots) 


125.9 


146.7 


178.3 


226.3 


310.3 


Standard deviation of D3 


132.7 


161.9 


211.5 


293.7 


456.1 



TABLE I: Short-term fairness of link 3 





Hi 


R2 


R3 


Ra 


Rs 


Re 


= 0.15 (Simulation) 


0.279 


0.386 


0.547 


0.548 


0.387 


0.279 


9 = 0.15 ([28 1) 


0.272 


0.347 


0.442 


0.442 


0.347 


0.273 


= 0.2 (Simulation) 


0.526 


0.837 


1.372 


1.371 


0.840 


0.526 


= 0.2 (1281) 


0.5 


0.75 


1.125 


1.125 


0.75 


0.5 


= 0.25 (Simulation) 


1.075 


2.229 


4.735 


4.733 


2.240 


1.072 


= 0.25 CI 28 1) 


1 


2 


4 


4 


2 


1 


= 0.3 (Simulation) 


3.210 


12.94 


52.76 


52.32 


12.91 


3.209 


= 0.3 ([28]) 


3 


12 


48 


48 


12 


3 



TABLE II: Comparison of access intensities 



To study the tradeoff between the load p and 
short-term fairness, we run Algorithm (fTTI i for 
p <= {0.65,0.7,0.75,0.8,0.85}. In each case, we collect 
the data of the access delay and compute its mean and 
standard deviation when r has almost converged. Table J] 
shows the results for link 3 (and other links have a similar 
trend). Note that when p increases, both the standard deviation 
and the mean increase, and their ratio increases too, indicating 
poorer short-term fairness. 

In 11281 . van de Ven et al. considered the line topology 
(i.e., 1-D lattice topology) and obtained the explicit expression 
of the access intensity of each link required to support a 
uniform throughput 9 for all the links, under the idealized- 
CSMA model without collisions El, |25], J26), J27j. For 
comparison, we simulate Algorithm 1 in a line topology with 
6 links, where each link conflicts with the first 2 links on both 
sides. After Algorithm 1 converges, we compute the access 
intensity of link k as Rk '■= Tj!/(l/pk — 1) (since the mean 
backoff time of link k is 1/pk — 1), and compare it to the 
result of Theorem 2 in ll28l (although the "access intensities" 
under the two models are not completely equivalent due to 
our inclusion of collisions.) Let pk = 1/16, Vfc, 7=1 and 
t' = 1. We simulate four sets of arrival rates, A = 9 • 1 where 
9 = 0.15,0.2,0.25 and 0.3, and give the results in Table HH 

The results show a close match, with relatively larger 
differences in R3 and R4. The reason is that link 3 and 
4 are in the middle of the network and suffer from more 
collisions. After each collision link 3 (or 4) needs to restart 
the backoff, which increases its effective backoff time and 
therefore requires a larger payload length to compensate. Also, 
all Rk's are higher in the simulation due to collisions and the 
overhead r'. 

B. Effect of dummy bits 

We have used dummy bits to facilitate our analysis and 
design of the algorithms. However, transmitting dummy bits 
when a queue is empty consumes extra bandwidth. In this 
subsection, we simulate our algorithms without dummy bits. 

In both Algorithm 1 and Algorithm ( fTTI i. we make the 
following heuristic modification. For each link fc, if tf (i) as 
computed in <TT~5T > is larger than the current (positive) queue 
length, then transmit a packet that includes all the bits of the 
queue as the payload. That is, no dummy bits are added. If the 
queue is empty then the link keeps silent. In the computation 
of s' k (i), however, the payload of the packet is counted as 
surprisingly, the modified algorithms are difficult 
to analyze, and we therefore do not claim their convergence. 
(However, they still seem to converge in the simulations.) 

Fig. [8] shows the evolution of the average payload lengths 
under Algorithm ( fTTI i without dummy bits, when p = 0.8. 
Indeed, the required payload lengths are significantly reduced 
compared to Fig. |7J (a) due to the saved bandwidth. 

Under Algorithm 1, however, we find that the required 
payload lengths are very close with or without dummy bits. 

7 The reason for this design is that, if we only count the actual bits 
tran smitted, then Algorithm (T7J could not converge. Indeed, if Algorithm 
H7\ converges, then the average service rates is strictly larger than the arrival 
rates, which is impossible if we only count the actually transmitted bits. 
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Fig. 9: Service rates in a 2-link network with hidden nodes 



Fig. 8: Algorithm ( fPTI i without dummy bits 



The reason is that, since Algorithm 1 only tries to make the 
average service rates equals the arrival rates, the queues do not 
have a negative drift towards zero. As a result, the queues are 
not close to zero most of the time, so dummy bits are rarely 
generated even if they are allowed. 

C. Effect of hidden nodes 

So far we have assumed that there is no hidden node (HN) 
in the network. In this subsection we discuss the effect of HNs. 

Consider a simple network with 2 links that are hidden 
from each other. That is, they cannot hear the transmissions of 
each other but a collision occurs if their transmissions overlap. 
Unlike the case without HNs, a link can start transmitting 
in the middle of the other link's transmission and cause a 
collision. 

First, to explore how much service rates can be achieved in 
this scenario, we let the two links use the same, fixed payload 
length T p. Let 7 = 5,r' = 10, and p k = 1/64, Vfc. The 
two links receive the same service rate by symmetry. Fig. [9] 
shows the service rate of one link under different values of 
t p . Note that the maximal service rate per link is about 0.12, 
much less than 0.5 in the case without HNs. Also, when t p is 
large enough, further increasing t p decreases the service rates, 
because larger packets are more easily collided by the HN. 

Then we simulate Algorithm 1 with arrival rates Ai = A2 = 
0.1 < 0.12. We set T = 15,r mm = 0,r max = 2.59 so 
that the maximal payload is T exp(r ma2; ) = 200 (slots), and 
a(i) = 0.14/(2 + i/100). Unlike the case without HNs, the 
results depend on the initial condition as shown in Fig.[l0] For 
example, if the initial payload lengths of both links are 40 slots 
(which we call "initial condition 1"), then the mean payload 
lengths converge to the correct value (about 17.5). However, 
if the initial payloads are 80 slots ("initial condition 2"), then 
the mean payload lengths keep increasing (until reaching the 
maximal value) and cannot support the arrival rates. This can 
be explained by Fig. [9] Initial payload lengths of 40 slots 
achieve a per-link service rate higher than the arrival rate. By 
Algorithm 1, the payload lengths are decreased and eventually 
converge to the correct values. However, if initially the payload 
lengths are 80 slots, a per-link service rate lower than 0.1 is 
achieved. By Algorithm 1, both links increase their payload 
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Fig. 10: Algorithm 1 with hidden nodes 



lengths. This, however, further decreases their service rates, 
and the cycle goes on. The root cause for this behavior is as 
follows. Algorithm 1 has implicitly used the fact that, without 
HNs, a link's service rate increases with its payload length. 
However, it may not be the case when HNs exist. 

To sum up, in the presence of HNs, both the achievable 
capacity region of CSMA and the property of our algorithms 
have changed. To address the HN problem, there are at 
least two directions to explore. The first is to understand 
the achievable capacity with HNs, and design algorithms to 
achieve the capacity. The second is to design protocols to 
remove or reduce HNs, so that our existing algorithms can be 
applied. There have been many proposals aiming to remove 
or reduce the HNs (see fl9l and the references therein). 



VI. Conclusion 

In this paper, we have studied CSMA-based scheduling 
algorithms with collisions. We first provided a model and 
gave a throughput formula which takes into account the 
cost of collisions and overhead. The formula has a simple 
product form. Next, we designed distributed algorithms where 
each link adaptively updates its mean transmission length to 
approach the throughput-optimality, and provided sufficient 
conditions to ensure the convergence and stability of the 
algorithms. We also characterized the relationship between 
the algorithm parameters and the achievable capacity region. 
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Finally, simulations results were presented to illustrate and 
verify the main results. 

In the algorithm, the transmission probabilities of the links 
are chosen to be fixed at a reasonable level, since we have 
shown that adjusting the transmission lengths alone is suffi- 
cient to approach throughput-optimality (the main goal of this 
paper). However, the choices of the transmission probabilities 
Pk's has an effect on the probability of collisions among the 
probe packets. In the future, we would like to further study 
whether the adjustment of transmission probabilities can be 
combined with the algorithm. Also, we are interested to further 
explore the short-term fairness and the case with hidden nodes. 

Appendix 

A. Proof of Theorem [7] 

First, the stationary distribution of the CSMA/CA Markov 
chain is expressed in the following lemma. 

Lemma 1: In the stationary distribution, the probability of 
a valid state w as defined by (0 is 



p(w) 



^ n » n \prf(hJ^)\ (22) 



where 



1 if j £ Z(x) 

P 3 {b ) ifjeS(x) 



(23) 



where Pj(bj) is the p.m.f. of link j's transmission length, 
as defined in (HJ. Also, E is a normalizing term such that 
J2 w p( w ) = 1' i- e -' a ^ probabilities sum up to 1. Note that 
p(w) does not depend on the remaining time a k 's. 

Proof: For a given state w = {x, ((b k , a k ), Vfc : x k = 1)}, 
define the set of active links whose remaining time is larger 
than 1 as 

Ai(w) = {k\x k = l,a k > 1}. 

Links in Ai(w) will continue their transmissions (either with 
success or a collision) in the next slot. 

Define the set of inactive links "blocked" by links in A\ (w) 

as 

dA\{w) = {j\xj = 0; e(j, k) = 1 for some k £ Ai(w)} 

where e(j, k) = 1 means that there is an edge between j and 
k in the conflict graph. Links in dA\{w) will remain inactive 
in the next slot. 

Write Ai(w) := Ai(w) U dAi(w). Define the set of all 
other links as 

A 2 {w) =N\Ai(w). (24) 

These links can change their on-off states Xfc's in the next slot. 
On the other hand, links in Ai(w) will have the same on-off 
states x k 's in the next slot. 

To illustrate these notations, consider the example in Fig. 
By ©, we have Ai{w) = {$},dA x {w) = {2},A 1 {w) = 
{2,3} and A 2 (w) = {1}. 

State w can transit in the next slot to another valid state 
w' = {x',{{b' k ,a' k ),Vk : x' k = 1)}, i.e., Q(w,w') > 0, if 
and only if w' satisfies that (i) x' k = x k ,\/k £ Ai(w); (ii) 



a 'k = 6fc, V& £ A 2 (w) such that x' k = 1, and b' k = 7,Vfc £ 
A 2 (w)nZ(x'). (If A 2 (w) is an empty set, then condition (iii) 
is trivially true.) The transition probability is 

Q(w,w')= [] \pi-fKhx')T' i 'k~ X ' i - (25) 

i£A 2 (w) 

Q(w',w):= J] \pi-f(b u i,x)T^- Xi . (26) 



Define 



(If A 2 {w) is an empty set, then Q(w, w') = 1 and Q(w' , w) := 
1.) If w and w' does not satisfy all the conditions (i), (ii), (iii), 
then Q(w,w') = 0, and also define Q(w',w) = 0. 

Then, if Q(w,w') > (and Q(w',w) > 0), 
p(w)/Q(w',w) = |^ i ^ 2W [ft•/( & t ,^)] I '9!" I •■ And 
p(w')/Q(w,w>) = in^ 2 »bi • fib'^x 1 )]*'^. But 
for any i ^ A 2 {w), i.e., i £ Ai(w), we have x[ = Xi, b\ = bi 
by condition (i), (ii) above. Therefore, the two expressions are 
equal. Thus 



p(w)Q(w, w') = p(w')Q(w' , w). 



(27) 



If two states w, w' satisfy Q(w,w') = 0, then by definition 
Q(w' , w) = 0, making (f2Tb trivially true. Therefore, d2~7b holds 
for any w and w'. 

We will show later that Q(w' , w) is the transition probability 
of the "time-reversed process" of the above Markov chain 
(notice the similarity between Q(w,w') and Q(w',w)), and 
naturally satisfies J2 W Qi w> \ w ) — 1- Assuming that the claim 
is true, then by ( f27l ). we have 



2^p(w) ■ Q(w,w') =p(w') 



Q{w',w) =p(w'). 



Therefore, p(w) is the stationary (or "invariant") distribution, 
which completes the proof of Lemma Q] 

It remains to be shown that the above claim is true, in 
particular, that J2 W Q{w',w) = 1. Denote the orignal process 
as {w(t),t £ Z} (this is the Markov process that describes our 
CSMA protocol, with transition probabilities Q(-, •) in d25ll). 
Adding a time index in (f3]), we have 



w(t) := {x(t), ((&*(*), a fc (i)),Vfc : x k (t) = 1)}. 



(28) 



bi 



l,Vfc £ Ai{w) such that x k = 1; (iii) 



Now define the time-reversed process w(t) := w(—t),yt £ 
Z. First, note that in the process {w(t)}, the remaining time 
a,k{t), if defined, decreases with i; in the reversed process 
{w(t)}, however, a,k(—t) increases with t. Therefore, {w(t)} 
and {w(t)} are, clearly, statistically distinguishable. So {w(t)} 
is not time-reversible. 

However, if we re-label the "remaining time" in the reversed 
order, then the process {w(t)} "looks a lot" like the process 
{w(t)}. (This is why we say the CSMA/CA Markov chain is 
almost time-reversible.) More formally, with the understanding 
that w = {x, ((bk,ak),Vk : x k = 1)}, define a function g(-) 
as 

g(w) = {x, {{b k ,b k -a k + 1), Vfc : x k = 1)}. (29) 
Then define the process 

w(t) :=g(w(-t)). 
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Note that in the process {w(t)}, the "remaining time" de- 
creases with t, similar to {w(t)}. 

Next we show the following two facts. 

Fact 1: For any two states w and w' with Q(w,w') > 
(i.e., if the CSMA Markov chain can transit from state w to 
state w'), we have Ai(w) = Ai(g(w')), Ai(w) = Ai(g(w')) 
and A 2 (w) = A 2 {g(w')). 

Fact 2: Q(w, w') > o- Q{g{w'), g{w)) > 0. 

These facts can be illustrated by the example in Fig. 
First consider Fact 1. Note that Ai(w) = {3}, by definition, 
is the set of links that are in the middle of a transmission in 
state w and will continue the transmission in the next state 
w'. Then, in the reversed process, such links are also in the 
middle of a transmission in state g(w') and will continue the 
transmission in the next state g(w). So Ai(w) = A\{g{w')). 
Similarly, dA\{w) = {2}, the set of links that are blocked by 
A\{w) in w are also blocked by Ai(g(w')) in the reversed 
process. Therefore dA\(w) = dA\(g(w')). Then by d24l i. we 
have A2(w) = A2(g(w')). (Note that it is not difficult to prove 
Fact 1 mechanically via the definitions of A±(-),A2(-). But we 
omit it here.) 

One can also verify Fact 2 in Fig. [5] We now give a 
more formal proof. If Q(w,w') > 0, then w and w' satisfy 
conditions (i)~(iii). We first show that Q(g(w'), g(w)) > 0. 
To this end, we need to verify that the states g(w') and 
g(w) satisfy condition (i)~(iii). Condition (i) holds because 
Ai(w) = Ai(g(w')) by Fact 1, and because g(-) does not 
change the "on-off state" of its argument. Condition (ii) 
holds since g(-) has reversed the remaining time (cf. d29]>). 
Condition (iii) requires that in the reversed process, any 
link k G A2(g(w')) which is transmitting in the state g(w) 
must have just started its transmission. This is true because 
A 2 (g(w')) = A 2 (w) by Fact 1, and that in the original process 
w(t), any link k G A 2 (w) which is transmitting in state w 
must be in its last slot of the transmission (otherwise the link 
would be in Ai(w)). Then condition (iii) holds since g(-) has 
reversed the remaining time. 

This completes the proof that Q(w, w') > => 
Q{g(w'),g{w)) > 0. Now, if Q{g{w'),g(w)) > 0, by the 
above result, we have Q(g(g(w)), g(g(w'))) > 0. Since 
g(g(w)) = w,g(g(w')) = w', we have Q(w,w') > 0. This 
completes the proof of Fact 2. 

Consider two states w and w 1 with Q(w, w') > 0. Then 

Q(g(w'),g(w)) > 0, with 

Q(g(w'),g(w))= ]J fa ■ f(b h i, x)]^ 1 "** 

ieA 2 (g(w<)) 

by (1251 . Using (|26| | and A 2 (w) = A2(g(w')), we have 
Q{w',w) = ]J \Pi-f(h,i,x)rql- x < 

i£A2(w) 

II \Pi-f(bi,i,x)) Xi ql- Xi 

i£A 2 (g(w')) 

= Q(g(w'),g(w)). (30) 

Therefore, Q(w',w) is the transition probability of the 
reversed process. 



By definition, Q(w',w) = for any w, w' satisfying 
Q(w,w') = 0. So, given w', 

w w.Q(w,w')>0 

J2 Q(g(w'),g(w)) 

w:Q(w,w')>0 

]T Q(g(w'),g(w)) 

w,Q(g(w'),g(w))>0 

= J2Q(g(w'),g(w)) = l 

w 

where the last step has used the fact that g(-) is a one-one 
mapping, so that the summation is over all valid states. ■ 

Using Lemma [U the probability of any on-off state x, as in 
Theorem Q] can be computed by summing up the probabilities 
of all states w's with the same on-off state x, using (1221) . 

Define the set of valid states B{x) := 
{w\ the on-off state is x in the state w}. By Lemma Q] 
we have 

V(x) = ^ p(w) 

weB(x) 

- ! e * n * n \prm,j,x)}} 

weB(x) i:xt=0 j-Xj=l 

= |( n * n pi) e n f 

i:Xi=0 j:xj=l wGB(x) j:Xj=l 

= |( n * n e [ n cd 

i:x i= j-Xj=l wGB(x) jeS(x) 

Now we compute the term J2 w eB(x)[Ujes(x) P jQ>j))- Con- 
sider a state w = {x, ((bk, a,k), Vfc : Xk = 1)} G B(x). For 
k G S(x), bk can take different values in Z ++ . For each 
fixed bk, a,k can be any integer from 1 to bk- For a collision 
component C m (x) (i.e., |C m (x)| > 1), the remaining time of 
each link in the component, a^ 71 ', can be any integer from 1 
to 7. Then we have 

e t n Wi)] 

weB{x) jes(x) 

- n e e n ( e !) 

jeS(x) bj l<aj<bj m:\C m (x)\>l l< a ('")< 7 

= n E^^^)]-7 h(a;) 

jes(x) 6^ 

= ( II T jh Hx) - (32) 
Combining OTt and d32l completes the proof. 

B. Proof of Theorem [2] 

1) Some definitions: If at an on-off state x, k G S(x) (i.e., 
k is transmitting successfully), it is possible that link k is 
transmitting the overhead or the payload. So we define the 
"detailed state" (x,z), where z G {0, 1} K . Let z k = 1 if 
k G S(x) and link k is transmitting its payload (instead of 
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overhead). Let z k = otherwise. Denote the set of all possible 
detailed states (x, z) by S. 

Then similar to the proof of Theorem Q] and using equation 
©, we have the following product-form stationary distribution 

p((x, z);r) = ^^ysO, z) ■ exp(^] z k r k ) (33) 



where 



g{x,z) = g(x) ■ ( r ')ls(*)|-i'z T( i' z 



(34) 



where l'z is the number of links that are transmitting the 
payload in state (x, z). 

Clearly, this provides another expression of the service rate 
Sfc(r): 

Sk(r) = P((x,z);r). (35) 

(l,2)ES:2 t =l 

Now we give alternative definitions of feasible and strictly 
feasible arrival rates to facilitate our proof. We will show that 
these definitions are equivalent to Definition [TJ 

Definition 2: (i) A vector of arrival rate A E 1Z+ (where K 
is the number of links) is feasible if there exists a probability 
distribution p over S (i.e., E(i z )gsP(( 2; i z )) = 1 an d 
p((x, z)) > 0), such that 



(36) 



(i,z)es 



Let Ceo be the set of feasible A, where "CO" stands for 
"collision". 

The rationale of the definition is that if A can be scheduled 
by the network, the fraction of time that the network spent 
in the detailed states must be non-negative and sum up to 1. 
(Note that ( f36l > is the probability that link k is sending its 
payload given the distribution of the detailed states.) 

(ii) A vector of arrival rate A <G 1Z+ is strictly feasible if 
it can be written as ( f36b where 2~2( x z)esP~(( x > z )) = ^ an< ^ 
p((x, z)) > 0. Let Ceo be the set of strictly feasible A. 

For example, in the ad-hoc network in Fig. [2] (b), A = 
(0.5, 0.5, 0.5) is feasible, because d36t holds if we let the 
probability of the detailed state (x = (1,0, l),z = (1,0,1)) 
be 0.5, the probability of the detailed state (x = (0, 1, 0), z = 
(0, 1,0)) be 0.5, and all other detailed states have probability 
0. However, A = (0.5, 0.5, 0.5) is not strictly feasible since 
it cannot be written as (f36t where all p((x,z)) > 0. But 
A' = (0.49, 0.49, 0.49) is strictly feasible. 

Proposition 2: The above definitions are equivalent to Def- 
inition Q] That is, 



Ceo 
Ceo 



= C 
= C. 



(37) 
(38) 



Proof: We first prove OTb . By definition, any A 6 C 
can be written as A = 2~2<jex P<? (J where X is the set 
of independent sets, and p = {pa)<jex is a probability 
distribution, i.e., p a > 0, 2~2aexP<? = Now we construct 
a distribution p over the states (x, z) £ S as follows. Let 
p{{cr, u)) = Pa^o & X, and let p{{x,z)) = for all other 
states (x, z) E S. Then, clearly J2( x z)es p(( x ' z ^ ' z = 



E CT G2f£ , (( <7 ' <7 )) ' a = H<jaxP° a = A ' which implies that 
A E Ceo- So, 

CCCco- (39) 

On the other hand, if A € Ceo- then A = 
2~2(x z)gsP(( x ' z )) ' z f° r some distribution p over S. 
We define another distribution p over X as follows. Let 

P<? = Y,( x ,z)eS: Z =aP(( X ^ Z ))y (J G X - Then ' A = 

2~2( x , z )esP(( x > z )) ■ z = Y,aexZZ(x, z )es:z=<jP(( x i z )) a = 
2~2<jgx Pc a > which implies that A € C. Therefore 



Ceo C C. 



(40) 



Combining $39$ and (gDJ yields d37t . 

We defined that C is the interior of C. To prove d38l , we 
only need to show that Ceo is a lso the interior of C. The proof 
is similar to that in Appendix A of [32), and is thus omitted. 

■ 

2) Existence of r* : Assume that A is strictly feasible. 
Consider the following convex optimization problem, where 
the vector u can be viewed as a probability distribution over 
the detailed states (x,z): 

max u {H(u)+ ^ i u (x,z) ■ \og(g(x 7 z))}} 

(x,z)eS 

s.t. Y u (x,z) = A fc ,Vfc 



(x,z)e<S:2fc = l 

U(x,z) > 0, Y ' 



Hx,z) 



1 



(41) 



where H(u) := E(x,z)es[ _u K^) io s( u (x,z))] is the "en- 
tropy" of the distribution u. 

Let r k be the dual variable associated with the constraint 
Y.{x,z)as-.z k =i u {x,z) = and let the vector r := (r k ). We 
will show the following. 

Lemma 2: The optimum dual variables r* (when problem 
(l4TT i is solved) exists, and satisfy ( 1101 , i.e., s k (r*) = Afc,Vfc. 
Also, the dual problem of ( BTt is ( [Til . 

Proof: With the above definition of r, a partial Lagrangian 
of problem gD (subject to u {x ^ z) > 0,T,( x , z ) u (x,z)eS = 1) 
is 



£(u;r) 

(x,z)ES 
k (x,z)£S:z k =l 

Y i u (x,z) l- l °s( u (x,z)) + log(g(x, z)) 

(x,z)GS 



(42) 



So 

dC(u; r) 



du 



(x,z) 



\og(u {XiZ) ) -l + \og(g{x,z))+ Y r k- 

k:z k =l 



We claim that 

(r) ■=P((x,z);r),V(x, z) E S (43) 
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(cf. equation d33l l) maximizes £(u; r) over u subject to 
— ®>2~2i x z) u (x.z)es = !■ Indeed, the partial derivative 



u (x,z) 

at the point u(r) is 

d£(u(r); r) 



du 



(x,z) 



log(£(r)) - 1, 



which is the same for all (x, z) G 5 (Since given the 
dual variables r, log(£7(r)) is a constant). Also, ur x z ^(r) = 
p((x,z);t) > and J2( x ,z)es u (*,z)( r ) = L Therefore, it is 
impossible to increase £(u; r) by slightly perturbing u around 
u(r) (subject to l T u = 1). Since £(u; r) is concave in u, the 
claim follows. 

Denote l(r) — max u £(u;r), then the dual problem of 
(l4TT i is inf r /(r). Plugging the expression of u/ x>z \(r) into 
£(u; r), it is not difficult to find that inf r Z(r) is equivalent 
to sup r £(r; A) where L(r; A) is defined in (V2\ . 

Since A is strictly feasible, it can be written as d36l l where 

J2( x z)gsP(( x > z )) = 1 an ^ P(( x ' z )) > 0- Therefore, there 
exists u >- (by choosing u = p) that satisfies the constraints 
in ( |4TT > and also in the interior of the domain of the objective 
function. So, problem ( HTT i satisfies the Slater condition 12. As 
a result, there exists a vector of (finite) optimal dual variables 
r* when problem (HTt is solved. Also, r* solves the dual 
problem sup r L(r; A). Therefore, sup r L(r; A) is attainable 
and can be written as max r L(r; A), as in (fTTT i. 

Finally, the optimal solution u* of problem (f4TT > is such that 
u \x z) = u (a;,jz)( r *),V(x, z) G S. Also, u* is clearly feasible 
for problem (|4TT >. Therefore, 



E 



s fe (r*) = A fc ,Vfc. 



(i,z)£S:zt=l 



Remark: From d42b and d43l) . we see that a subgradient (or 
gradient) of the dual objective function L(r; A) is 

dL(r;X) 



dr k 



= A fe - 



E 

(i,z)eS:zi,= 



*(z,z)( r ) = A fc - S fe( r )- 



This can also be obtained by direct differentiation of L(r; A). 

Uniqueness of r* : Now we show the uniqueness of r*. 
Note that the objective function of ( f4TT > is strictly concave. 
Therefore u*, the optimal solution of ( f4TT > is unique. Consider 
two detailed state (efc,efc) and (efc,0), where is the K- 
dimensional vector whose fc'th element is 1 and all other 
elements are 0's. We have u*, ek _> = p((e k , e k ); r*) and 
u U,o) =P((e* J 0);r*). Then by'©, 

W(e fc: e fc )( r *)/ u (e fc: o)(r*) = exp(r£) • (T /r'). 



(44) 



Suppose that r* is not unique, that is, there exist rj 7^ Tjj 
but both are optimal r. Then, r| fc 7^ rjj fe for some fc. This 
contradicts ( f44l > and the uniqueness of u*. Therefore r* is 
unique. This also implies that max r L(r; A) has a unique 
solution r*. 

C. Proof of Theorem \3\ 

We will use results in l23l to prove Theorem [3] Similar 
techniques have been used in ll22l to analyze the convergence 
of an algorithm in lfl6l . 



1) Part (i): Decreasing step size: Define the concave func- 
tion 



H(y) r- 



-(r„ 



y f/2 if y < r„ 
if y E [r. 
- y) 2 /2 if y > r n 



mtn ■> ' max] 



(45) 



Note that dH(y)/dy = h(y) where h(y) is defined in 
fl4j. Let G(r; A) := L(r; A) + J2 k H{r k ). Since A is strictly 
feasible, max r L(r;A) has a unique solution r*. That is, 
L(r*;X) > i(r;A),Vr ^ r*. Since r* G {r min ,r max ) K by 
assumption, then Vr, ^ fe H{r^) = > J^k H{ r k)- Therefore, 
G(r*; A) > G(r; A), Vr ^ r*. So r* is the unique solution of 
max r G(r; A). Because 9G(r; X)jdr k = \k — Sfe(r) + h(rk), 
Algorithm 1 tries to solve max r G(r; A) with inaccurate gra- 
dients. 

Let v s (t) be the solution of the following differential 
equation (for t > s) 



dv k {t)/dt = A fe - s k (v(t))+h(v k (t))yk 



(46) 



with the initial condition that v s (s) = f(s). So, v s (t) can 
be viewed as the "ideal" trajectory of Algorithm 1 with the 
smoothed arrival rate and service rate. And (l46T i can be viewed 
as a continuous-time gradient algorithm to solve max r G(r; A). 
We have shown above that r* is the unique solution of 
max r G(r; A). Therefore v s (t) converges to the unique r* for 
any initial condition. 

Recall that in Algorithm 1, r(i) is always updated at the 
beginning of a minislot. Define Y(i — 1) := (s' k (i) , Wo(i)) 
where wo(i) is the state w at time ij. Then {Y(i)} is a non- 
homogeneous Markov process whose transition kernel from 
time to tj depends on r(i — 1). The update in Algorithm 
1 can be written as 

r k (i) = r k (i - 1) + a(i) • [f(r k (i - 1), Y(i - 1)) + M(i)) 

where f(r k (i - l),Y(i - 1)) := X k - s' k {i) + h{r k (i - 1)), 
and M(i) = X' k (i) — X k is zero-mean noise. 

To use Corollary 8 in page 74 of l23l to show Algorithm 
l's almost-sure convergence to r*, the following conditions 
are sufficient: 

(i) /(•,•) is Lipschitz in the first argument, and uniformly in 
the second argument. This holds by the construction of h(-); 

(ii) The transition kernel of Y(i) is continuous in r(i). This 
is true due to the way we randomize the transmission lengths 
in O- 

(iii) d46b has a unique convergent point r*, which has been 
shown above; 

(iv) With Algorithm 1, r k (i) is bounded Vfc, i almost surely. 
This is proved in Lemma [3] below. 

(v) Tightness condition ((f) in 1231 . page 71): This is 
satisfied since Y(i) has a bounded state-space (cf. conditions 
(6.4.1) and (6.4.2) in J23], page 76). The state space of Y(i) 
is bounded because s' k (i) G [0, 1] and wo(i) is in a finite set 
(which is shown in Lemma H} below. 

So, by ||231 . r(i) converges to r* almost surely. 

Lemma 3: With Algorithm 1, r(i) is always bounded. 



Specifically, r k (i) £ 



2, TV, 



2A], Vfc, i, where A, 
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as defined before, is the maximal instantaneous arrival rate, 
so that X' k (i) < X,\/k,i. 

Proof: We first prove the upper bound r max + 2X by 
induction: (a) r k (Q) £ r max < r^ax + 2A; (b) For i > 1, if 
r k (i - 1) € [r max + \ L r m ax + 2A], then h(r k (i - 1)) < -A. 
Since X' k (i) - s' k (i) < A, we have rk(i) < r k (i — 1) < 
2A. If r fe (i - 1) € (r mm ,r mQX _+ A), then fe(rjfc(i - 1)) < 0. 
Also since X' k (i) — s' k (i) < A and a(i) < l,Vi, we have 
Tk{i) < r k (i-l) + X-a(i) + 2A. If r fe (i-l) < 

' mm? 

then 



= r fc (* - 1) + a(i)[A' fc (i) - 4(0 + h(r h (i - 1))] 

< r fc (i - 1) + a(i){\ + [r min - r k (i - 1)]} 
= [1 - a(i)] • r fc (i - 1) + a(i){A + r mi „} 

< [1 - a(i)] ■ r mm + a(i){\ + r min } 
= r mm + a(i) • A 



< A 



2A. 



The lower bound r m i n — 2 can be proved similarly. ■ 
Lemma 4: In Algorithm 1, Wo(i) is in a finite set. 

Proof: By Lemma [3] we know that r k (i) < r max + 
2X,Vk,i, so T?(i) < T exp(r mox + 2\),Vk,i. By Q3J, we 



have T^(i) < ToCxp(r 



■ 2A) + 1, Vfc, i. Therefore, in state 



w (i) = {x, ((6 fc ,a fc ),Vfc : x k = 1)}, we have b k < b max for 
a constant b max and a k < b k for any k such that Xfe = 1. So, 
u>o(i) is in a finite set. ■ 
2) Part (ii): Constant step size: The intuition is the same as 
part (i). That is, if the constant step size is small enough, then 
the algorithm approximately solves problem max r G(r; A). 
Please refer to l30l for the full proof. 



D. Proof of Theorem [4] 

Let N' be the number of detailed states (x, z)'s, and 
u e 1Z+ be a probability distribution over the detailed states. 
For convenience of notation, we use i = 1, 2, . . . , N' to index 
the detailed states. Then it,, the i-th element of u, is the 
probability of the i-th detailed state. Let V be the set of 
N' -dimensional probability distributions, i.e., V := {u' 6 
7?.+ '|l T u' = 1} . Also define a K x N' matrix A where 
the element A kt i = 1 if link k is transmitting its payload in 
the i-th detailed state, and A k: i = otherwise. Then, A ■ u is 
the vector of achieved throughputs of the K links under the 
distribution u. 

Lemma 5: Given A at the boundary of C, there exists a 
constant b > such that the following holds: For any < e < 
1, if u S V and A ■ u = (1 — e)A, then one can find u 6 P 
such that A ■ u = A and ||u — u||i < 2b ■ e. 

Proof: Consider the set U := {u|u G V,A ■ u = p ■ 
A for some < p < 1}. Clearly hi is a poly tope since Z// 
is defined by linear constraints. Denote by £(U) the set of 
extreme points of U. Then, if u £ U and satisfies A ■ u = 
(1 — e)A, we have 



u = 



(47) 



where a y > and X) y e£(W) a y = ^ so ' by definition, for 
any y £ £(U), A ■ y = p y X for some p y G [0, 1]. Then, 



a-u= ( a v A ■ y) = P" x 

where p u = J2 y e£(U)( a yPy) = 1 ~ e - 
For any u' G W, define 



D(u') 



z i 



s.i. A-z = A. 



(48) 



It can be shown that D(u') is a convex function of u'. (Proof: 
Consider any two u' 7 and u' 7/ . When u' = u' 7 (or u J7 ), 
suppose zj (or zjj) is an optimal solution of problem d48l i. 
That is, D(u'j) = \\u'j — zj\\i and D(u'jj) = ||u J7 — zu\\i. 
Given any /3 G [0, 1], we have 

(3 ■ D(Wj) + D(u' n ) 
= ^|[u^ - zj-IU + (1 - ^)||u' ZJ - z/j-Hi 
> -zj) + (l- ( 8)K J -z JJ )|| 1 

= |puj + (1 - /3)u 7/ ] - [/3z 7 + (1 - /3)z // ]|| 1 .(49) 

Now consider D(^u' 7 + (1 - /3)u' n ). Clearly, z := /3zj + 
(1 — /3)zjj satisfies the constraint A ■ z = A. So, D(/3u'j + 

(i-P)u' n ) < m/K + a-^Kji-^zj + 

Combined with (@9), we have D(^u 7 + (1 - /3)u' 7/ ) < /3 • 
D(u' I )+(l— ,S)-D(uj 7 ), which implies that -D(u') is a convex 
function of u'.) 

Using this fact and ( f4~Tb . we have 



I»(u)< ^ (oyI>(y)) = 



yef(w) y e£(W). Py <i 



(a y £>(y)) (50) 



where the second step is because if p y = 1, then D(y) = 0. 
Define 



I ye£(U).py<l 



(51) 



Since there are a finite number of elements in £(U), we have 
< b < +oo. Then 

ye£{U), Py <l 

< 2b- J2 i^-^-Py)} 

ye£(W),p y <l 

= 2b- K-^-Py)) 

ye£(U) 
= 2b[l - p u ] = 26e. 

Let u be a solution of (l48l > with u' = u, then ||u — u||i = 
D(u) < 2be. ■ 
Lemma 6: Assume that u, u G V and u ^ u. If ||u — 

u||i < 2c for a constant c G (0, 1], then \H(u) - H(u)\ < 

c ■ [\og(N'/c) + 1], where H(u) := E^'ih^ Iog(«i)] is ^ 
"entropy" of the distribution u. 

Proof: Let (u — u) + and (u — u)~ be the positive part 
and negative part of u — u. Then clearly ||u — u||i = ||(u — 
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u) + 1 1 1 + 1 1 (u — u) Also, since l T u = l T u = 1, we have 
|| (u — u) + ||i = ||(u — u)~||i. Therefore, 

0<||(u-u)+|| 1 = ||(u-u)-|| 1 = i||u-u|| 1 <c. 

Let the index sets X\ — {i\i G {1,2, N'},Ui > u^} 
and X 2 = {i\i G {1,2,..., N'}, u ?; < u t }. Define N x := 
\li\,N 2 := |X 2 |. Clearly Nx + N 2 < N', and N U N 2 > 0. 

Now we show that H(u) - H(u) < c ■ [\og(N'/c) + 1}. 
Define the function f(x) := x\og(l/x). First, for i G I±, 
denote 5i = Ui — iii > 0, then 1 1 (u — u) + ||i = J2ieii ^- ^ ote 
that Ui,Ui,5i G [0, 1]. Since f(x) is concave, with ui > ui > 
0, we have f( Ui ) - /(«*) < f(5 f ) - /(0) = f(5i). So 

Ei/K) -/(*)]<£/(*) 
< £/( „,. ) 



■ ,(»-a)+|k 

ll(u-u)+|| 1 log( 1 



< ||( U -u)+|| 1 log( 1 



(U-U) + ||!- 

N' 



(52) 



(u-u)+||/ 

where the second inequality follows from the fact that f(x) 
is concave and J^iei = ll( u — u) + 1 1 i . 

It is easy to show that f(x) is increasing in the range x G 
[0, exp(— 1)]. Also, we have N' > 3 (even in the smallest 
one-link network). Thus ||(u - u)+||i/iV' < c/N' < 1/N' < 
exp(— 1). Therefore, from 021 . 

£[/(«*)-/(*)] < ^-/( ll(u Ar fl)+I ' 1 ) 

iGXi 

< iV'-/(^ 7 )=clog(^).(53) 

Since /(.t) is concave, we have /(u,) — /(w,*) < (uj — 
Ui)f'{ui). For i G since iti < itj and f'(ui) > —1 for any 
u, G (0, 1], we have 



< 



iex 2 



»6l a 



(uj - Ui) < c. (54) 



Using d53]> and 0J]|, we conclude that H(u) - H(u) < 
c ■ [log(iV'/c) + 1]. The same argument shows that H(u) — 
H(u) < c ■ [log(iV'/c) + 1]. Therefore the lemma follows. ■ 
Now we are ready to prove Theorem |U 

As explained before, r*(A) is the vector of optimal dual 
variables in the optimization problem (t4lT) . simply written as 



V(X) := maxve-p ^( v ) + / ,9i v i 

i 

s.t. A-v = X 



(55) 



where g^,i = 1,2, ...,N' denote the constants \og(g(x, z)) 
for (x, z) G S. Let u* be the optimal solution when A = 
(1 - e)A. Then V((l - e)A) = H(u*) + J2 l9l u*. 

Consider the case when e < 1/6. By Lemma [5] there exists 
ueV such that A ■ u = A and ||u* - u||i < 26 • e < 2. By 
Lemma |6] 

\H(u*) - £T(u)| < 6 • e[logi) + 1]. (56) 

6 • e 




1 | 1 2j [3j [4^ [5j [6j [7J [8j [9j 10 

Fig. 11: Conflict graph of a line network 

Since QeP satisfies A • u = A, we have V(X) > H(u) + 
J2i 9iUi- Also, the "value function" V(A) is concave in A (121 
page 250), and satisfies W(\) = — r*(A). Therefore 

H(u) + J29iUi < V(X) 

i 

< V((l - e)A) + (A - (1 — e)Af [-r*((l - e)A)] 
= ff(u*)+E5i<-e-A T r*((l-e)A). 

i 

So, 

e-A T r*((l-e)A) <H(u*)-H(u) + E^«-^)- (57) 

i 

Denote G := max^ then 

< G^|< < 2(76e. (58) 

Combining (ED, ([58) and ®, we have e-A T r*((l-e)A) < 
b ■ e[log(^) + 1] + 2G6e, which proves (HQ. 

For the second case when e > 1/6, choose an arbitrary u G 
V such that A-u = A. Clearly, | |u* - u||i < 2, so the RHS of 
d58) can be replaced by 2G. Also, |F(u*) -i?(u)| < log(TY') 
since if(-) G [0, log(AT')]. Using these in d57]l yields dT9b . 

£. Simulations of 1-D and 2-D lattice topologies 

Consider a line network (i.e., 1-D lattice network) with 16 
links, where each link conflicts with the nearest 2 links on 
each side (so, link k conflicts with 4 other links if it is not at 
the two ends of the network), as in Fig.Qj] Let 7 = 5, t' = 10 
and pk = 1/16, Vfc. In each simulation, we let all links use 
the same, fixed payload length T p = 30, 50, 100, 150, and we 
compute the "short-term throughput" of link 8 (in the middle 
of the network) every 50 milliseconds (or 50/0.009 w 5556 
slots). That is, we compute the average throughput of link 8 
in each time window of 50 milliseconds. 

Note that a successful transmission has a length of r' + T P . 
Also note that if the parameter r max in Algorithm 1 satisfies 
that To exp(r max ) = T p , then it is possible that all links 
transmit payload T p . 

The results are plotted in Fig. [T2] We see that the oscilla- 
tion in the short-term throughput increases as T p increases, 
indicating worsening short-term fairness. 

Next we simulate a 2-D lattice network with 5*5=25 links in 
Fig. [13] Each link conflicts with the nearest 4 links around it (if 
it's not at the boundary). Similar to the previous simulation, we 
use different values of T p and obtain the following short-term 
throughput of link 13 (which is at the center of the network) 
in Fig. [14] Again we observe wrosening short-term fairness 
as T p increases. Also, the oscillation of link 13's short-term 
throughput is greater than link 8 in the line network, since 
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Fig. 12: Short-term throughput of link 8 in the line network 



link 13 has 4 conflicting neighbors which do not conflict with 
each other. 

However, as mentioned before, it remains an open problem 
to exact characterize the relationship between short-term fair- 
ness (in particular the standard deviation of the access delay) 
and T p (or equivalently, r max ) in general topologies. We are 
interested to further explore useful methods to quantify the 
relationship. 
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